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The recent trend in location-based services has led to a proliferation of 
studies in indoor positioning technology. Wi-Fi received signal strength 
indicator (RSSI) Fingerprinting and pedestrian dead reckoning (PDR) are the 
two best representatives from both approaches. This research proposed a 
genetic algorithm to combine Wi-Fi Fingerprinting and PDR. By taking 
advantage of PDR and genetic algorithm, we only need to collect a limited 
number of points for the fingerprint dataset with known coordinates, then 
target trajectories' position can be estimated with high accuracy. Results 
from our experiments and simulations have shown that even in the scenario 
of noisy inertial measurement unit (IMU) sensors data, using RSSI 
measurements and the coordinate of 8 points, our proposed method was able 
to achieve 1.589 meters of average distance error which is 34.4 percent 
lower than the conventional Fingerprinting method. 
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1. INTRODUCTION 

Indoor positioning (IP) is receiving much attention due to advancements in sensor technology and 
the high demand for location-based services. Large indoor environments like shopping malls, train stations, 
and airports are now trying to deploy internet of things (IoT) systems to improve user experience and quality 
of service. Thus, it opens the opportunity for indoor localization to be a core technology for user's navigation 
systems in such large indoor spaces. 

The global positioning system (GPS) is a popular method for outdoor navigation. However, its 
indoor performance drops significantly because of blockage in GPS signals when targets are inside a building 
or underground. Even outdoor, GPS accuracy could be affected by various factors, for example, atmospheric 
conditions and multipath [1]. Thus, many new emerging techniques for indoor positioning have been studied 
and proposed. Depend on different sensors technology and infrastructure, IP can be categorized into many 
approaches [2]-[4]. 

One popular approach that take advantage of inertial measuring unit (IMU) is pedestrian dead 
reckoning (PDR) [5]-[7]. Using signals from IMU sensors like accelerometer, gyroscope and magnetometer, 
step event [8], step length [9], [10] and heading angle [11] are extracted. Because most smartphones are 
equipped with IMU, PDR does not require indoor infrastructure to be set up. In addition, data is updated 
frequently, which enable real-time localization. However, in PDR, target's initial location must be known 
because current location is estimated based on the previous location. Furthermore, sensors in smartphones 
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and other mobile devices are subjected to noise, interference and tend to produce accumulated errors over 
time [12]. Thus, complex sensor fusion techniques were utilized to complement between sensors or integrate 
them with other wireless technologies like Wi-Fi or Bluetooth to improve accuracy [13]-[18], popular 
techniques namely Kalman Filter [19], Extended Kalman Filter [20], Madgwick Filter [17], and Particle 
Filter [21]. 

Another popular approach is to use Radio frequency, which includes a wide variety of wireless 
technology like Wi-Fi, Bluetooth [22], [23], Zigbee [24], Ultra- Wideband [25], radio frequency identification 
(RFID) [26]. Methods that utilize Wi-Fi is popular since most public or private indoor places have Wi-Fi 
access points (APs) installed and nearly every mobile device is equipped with a Wi-Fi transceiver module. 
Many researchers have tried to apply outdoor navigation techniques like triangulation, which is divided into 
two subgroups (lateration and angulation) [26]. The lateration is based on distance and is used in time of 
arrival (ToA) [27], [28] while angulation is based on direction and is applied in angle of arrival (AoA) [29], 
[30]. A well-known technique in Wi-Fi indoor positioning is received signal strength indicator (RSS 
fingerprinting [31]—[34]. In this method, the assumption was made that for every location in the environment, 
its Wi-Fi received signal strength indication (RSSI) measurements are unique and distinctive from others. 
Then, signals from those positions are recorded and stored in a fingerprint database called the radio map. 
After fingerprints are collected, researchers can start measuring the user’s Wi-Fi RSSI and match it to the 
similar point in the radio map to acquire user position. The step of constructing the database is also known as 
the offline phase and the matching step is known as the online phase. In online phase machine learning 
methods can be used to learn the pattern and produce more accurate results. 

In comparison with other indoor positioning techniques, Wi-Fi fingerprinting precision, deployment 
cost, and availability are superior. It can be applied to nearly every indoor scenario where a Wi-Fi network is 
installed. However, there are certain drawbacks associated with the use of Wi-Fi fingerprinting. First, Wi-Fi 
RSSI is subjected to multipath and shadowing interferences in a noisy environment [35]. Thus, the 
measurements are no longer stable and reliable. Secondly, the offline phase requires intensive fingerprint 
collecting which consume a lot of time and effort, especially in a large environment. In addition, adjustments 
in the interior of the environment make the radio map incorrect and need to be recollected. 

In this paper, the problem of Wi-Fi Fingerprinting is addressed by proposing a unique method to 
combine PDR and Wi-Fi Fingerprinting using the genetic algorithm. The proposed method would reduce 
greatly the amount of data needed to be collected in the offline phase while achieving a high level of 
accuracy. Using only a small number of labeled RSSI measurements as anchors, user track’s coordinates can 
be estimated by utilizing RSSI and motion from the user’s mobile device. Experiment and simulation have 
shown that the proposed method using the support vector machine model achieved an average distance error 
of 1.589 meters in the situation of motion error reaching 30%. In addition, only 8 anchor points were used, 
and the result compared with conventional Wi-Fi fingerprinting is 34.4% lower. 


2. LITERATURE REVIEW 

This section discussed the existing indoor positioning methods. We mainly focused on methods 
without offline fingerprinting map and analyze their advantages along with drawbacks. In [36] Jang and Kim 
classified methods in this approach into three categories, which are simultaneous localization and mapping 
(SLAM); inter/extrapolation; and crowdsourcing-based technologies. 


2.1. Simultaneous localization and mapping (SLAM) 

SLAM is a technique that allows researchers to construct the map of the landscape and estimate 
location of the target simultaneously without prior knowledge of the environment. The principle of SLAM is 
to use the previous target’s locations, control inputs and landmark observation as the condition for the joint 
probability distribution of target’s state and landmark location at current time. Ferris et al. in [37] proposed 
Wi-Fi-SLAM, which use Gaussian process-latent variable model (GP-LVM) to transform the 3 dimensional 
data to 2 dimensional coordinate. Instead of using fixed measured data from AP, Brian used them as latent 
variable then probabilistically modeled the relationship between the latent variable and the observed data 
from target. The relationship can be recovered using the optimization of the marginal likelihood. Due to the 
high time complexity of GP-LVM, it would be more suitable for a robot equipped with a dedicated computer 
than a mobile device. In addition, GP-LVM tend to replace curves in the ground truth with straigh lines in the 
produced map. GraphSLAM in [38] was proposed to overcome the time complexity and the dependence on 
signal-rich environments of GP-LVM. Authors of GraphSLAM assume that in any small and free space, the 
intensity of propagated radio waves are almost equal, this allows GraphSLAM to be applied in a variety of 
environment. In addition, GraphSLAM achieved O(n‘2) asymptotic runtime by omitting the whitening 
transform on the weighted observations and their weights. However, while applying GraphSLAM to large 
space, it would need to be interpolated into smaller area because of the signal strength constraint mentioned 
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earlier. The interpolating process, if is not done correctly could affect the performance. WiSLAM in [39] is 
the integration of two method which are PlaceSLAM [40] and FootSLAM [41]. PlaceSLAM performs indoor 
localization using RSS values and the location of user is associated with the proximity information. In 
contrast to using proximity information or landmarks, FootSLAM utilizes probabilistic motion model over 
space to construct the map of the environment. Since FootSLAM uses IMU, authors countered sensor error 
by Rao-Blackwellized particle filter (RBPF) [42]. 


2.2. Extrapolation and interpolation 

Triangle interpolation and extrapolation (TIX) [43] creates mapping curves by using at least 3 
access points (APs) to measure receive signal strength (RSS) between themselves. Then, server can estimate 
the distance between target and each AP using the mapping curve and RSS measurement from target’s 
device. A technique called proximity in signal space (PSS) is used to choose the suitable mapping curve. 
Major drawbacks of this method are the reliance on APs as their location need to be known beforehand and 
TIX also need at least 3 APs to perform. Furthermore, when the number of APs increases, the computational 
cost also grows. Signal distance map (SDM) in [44] is quite similar to TIX because they both use RSS to 
locate client position and attain the mapping function using inter-AP measurements. The main difference 
comes from a more complex algorithmic computation that author used in SDM. Although the result of SDM 
is better than TIX, the trade off is the increase in calculation time of SDM. 


2.3. Crowdsourcing and crowdsensing 

While other techniques focus on exploiting the wireless network infrastructure and _ sensors, 
crowdsourcing and crowdsensing takes an approach which take advantage of the crowd wisdom to make 
decision. User contribution is the most important factor as it affects the performance of a crowdsourcing and 
crowdsensing system. LocateMe in [45] utilizes and captures the magnetic signatures in hallways while person 
is walking in the environment. Those signatures are stored in a database similar to fingerprinting database and 
when a new magnetic signature is collected, it will be classified to one of the fingerprints in the database. The 
main disadvantage of this method is that magnetic measurements is often inconsistent because of the existence 
of magnetic material, metal and also electronics devices in the environment. QrLoc in [46] is a crowdsourcing 
method which substitute the data collection of fingerprinting method by having user scan QR codes attached to 
the landmarks to mark his position and record the Wi-Fi measurements. Walkie-Markie [47] is a method which 
based on crowdsourcing to create the map of an indoor environment by using user’s trajectory acquired from 
IMU sensors and Wi-Fi landmarks. Wi-Fi landmark is refered as RSS trend tipping point (RTTP), where signal 
strength shift from rising to falling. When users move around the environment, mobile device would calculate 
stride length and direction using motion data while simultaneously scans Wi-Fi signal strength to detect Wi-Fi 
landmarks. The Wi-Fi landmarks can be created by different tracks from different users and the server would 
cluster those landmarks into one landmark using clustering algorithm. 

Based on the literature review of several indoor positioning method without offline survey 
fingerprinting, it reveals the strengths and limitations exist in each one of them. Although SLAM doesn’t 
need information about the environment, it requires observable landmark to locate itself. In addition, the 
reliance on probabilistic model make SLAM perform heavy computation which hinders its ability to run on 
mobile devices. Extrapolation and interpolation may solve the problem of time complexity of SLAM, but this 
method require several condition and restriction like knowing the location of APs or the minimum number of 
APs for the method to perform. With crowdsourcing, the idea to use the participation of user is a great 
solution to eliminate labor intensive data collecting procedure and this idea was utilized in our proposed 
method. In addition, the proposed method aim is not to build the map of the environment and it relies on 
genetic algorithm instead of probabilistic model to determine user position. As the result, our method is 
simpler to be implemented and doesn’t require specific knowledge of APs to perform. 


3. RESEARCH METHOD 
3.1. Overall system architecture 

The proposed method architecture is divided into two main phases, which is Data preparation and 
Coordinate optimization as illustrated in Figure 1. At first, on the leftmost, the Wi-Fi access points network 
are set up in the environment and transmit signals to devices denoted as mobile phone. In this step, the 
collected data is separated into two groups, RSSI measurement points with coordinate denoted as anchors and 
RSSI measurement points plus motion data without coordinate denoted as user data. Anchors would then be 
used as the testing dataset while motion data is utilized by genetic algorithm to generate an estimation of 
user’s coordinate. Those coordinates combine with RSSI measurement points from user data serve as the 
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training dataset for machine learning models and error is evaluated using the anchors dataset mentioned 
earlier. If the calculated error satisfies the convergence condition, user’s estimated coordinates are selected. 


Data preparation Coordinates optimization 


User data collecting Genetic Algorithm 


Possible track Genetic Best track 
coordinate operators coordinate 


Heading angle User 


Stride length anes a 
an 
Motion data Anchors —— coordinates 


Machine learning || Check converge 


? Anchors model condition 
Wifi Access coordinates 


Point: 


Figure 1. Proposed method system architecture 


3.2. Data preparation 

The anchor dataset K is collected in a similar way as the radio map of the fingerprinting approach. 
First, several points in the environment are selected to be anchors. Then a user with a small computing unit 
equipped with a Wi-Fi transceiver module will go to those designated points and record the RSSI 
measurements from access points (APs). The process is illustrated in Figure 2, the red markers represent the 
anchors and for each anchor, RSSI measurements from APs and anchors’s coordinate are recorded in form: 


K; = {(xj,y;), (RSSI}, RSSI?, RSSI?, ... RSSI}, ..., RSSI)}j € [1, Nel, i € [1M] (1) 


Where: 

Nx is the number of anchors point in dataset K 

M is the number of access point 

RSSIj. is the RSSI measurement from AP, of anchor point i*” 
(x;, y;) is the coordinate of anchor point i” 


{(xy, yw), (RSSI}, RSSIZ, RSSI}, ... RSSIK, ..., RSSIHY} 


Figure 2. Anchor points collecting procedure 


User dataset U includes RSSI measurements and motion data of users while they are moving. Users 
are required to have devices equipped with Wi-Fi module and IMU sensors in this case. For simplicity, an 
PDR algorithm is assumed to be used to extract heading angle and stride length from user motion data. 

In Figure 3, a user moving along a track marked as blue. RSSI measurements points are marked as 
black on the trajectory. On the user trajectory, straight line between RSSI measurement points and angle form 
by 2 consecutive vectors were used to represent the result of PDR algorithm on motion data. For each RSSI 
measurement point in dataset U, the data is in the form: 
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U, = {(RSSI}, RSSI2, RSSIZ, ...RSSIj, .., RSSI), (a, di}, k € (1, Null (2) 


Where: 

Ny is the number of data point in dataset U 

M is the number of access point 

RSSIi. is the RSSI measurement from AP, of point k“" 

a; is the angle formed by two vectors U; Waa and i eee 
d; is the Euclidean distance between U;, and Ux 


{(RSSI}, RSSIZ, RSSH, ... RSSHi, ..., RSS), 
(a1, d1)} 


Figure 3. User data collecting procedure 


In real cases, the proposed method does not have any knowledge of the user coordinate beforehand. 
But to perform validation of the method, the corresponding coordinate of RSSI measurement points in dataset 
U are collected. After that, the track are generated from the coordinate is denoted as, 


Treat = (Xx, Ye )} k € [1, Ny] (3) 


3.3. Proposed evaluation metric 

In dataset U, the angle and the distance between data points are known so the shape of user 
trajectory can be obtained. This is done by using PDR method, but since the initial location of the user is 
unknown, so the coordinate of the user track can not be obtained. Assuming that the coordinate of the user 
track is randomly generated and denoted as Tyeneratea = ((X» Yx)} kK € [1, Ny] with the condition that the 
shape of the generated track must remain the same as the result of PDR method. There will be multiple 
generated user track in different locations and the task is to select the track which is closest to the real track 
denoted as Tq, = {(%x, ¥x)}- To do this, a new metric is proposed to measure how close Tgeneratea and 
T;eat. First, the Tyeneratea is mapped to dataset U and used as training dataset for the machine learning 
model, then the dataset K is used as the testing dataset. The error is defined as the average Euclidean distance 
between the predicted coordinate and the coordinate in dataset K, it is calculated, 


7 V@i-%)2+01-Fi 


NK 


Err = (m) i € [1, Nx] (4) 


Where: X; and ¥; is the model predictions of x; and y; in dataset K. 

Conventionally, calculated value from (4) show how well the trained model performs on testing 
dataset K. However, it is observed that this value is also reflects how good training dataset is. The idea is that 
if a machine learning model was trained using accurate training dataset, it would perform better than the one 
receives poor and inaccurate training data. Using this idea, many Tyeneratea Can be created and the one with 
the lowest Err will be selected. However, it’s worth mention that the selected track is not guaranteed to be 
the closest track to T;.g; because the evaluation is on dataset K. Therefore, the result depends largely on how 
good and accurate anchor dataset K is when they were collected. A simulation was carried out where anchor 
dataset K was collected and a predefined track T;.g; is simulated inside the experimental area, then 100 
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Tgeneratea tracks were randomly created with the same shape as T;-eq;. Besides the proposed metric, the real 
average distance error (closeness), denoted as Err,eq, , between Tyeneratea and T;-eq; is calculated using (5). 
The relationship between Err and Err;¢q; is illustrated in Figure 4. 


pale “(CeK—% ke)? + e-Te)” 


Nu 


ErTreal = (m)k € [1, Ny] (5) 


Where: 
X, and ¥, are the coordinate from Tyeneratea 
x, and y, are the coordinate from T;eg; 

It is clear that there is a linear relationship as when value from of Err increases, the real error 
ETr,eq, also grows. This proves that if the Tyeneratea iS far from the T;-¢q;, then the Err would be high and 
when Tyeneratea 18 close to the real track, Err would be low. As mentioned ealier that the closest track may 
not be obtained and by looking at the lower left of Figure 4, it can be observed that the track with the smallest 
Err;eq, Value has Err value a bit higher than it’s neigbors. However, this is not a serious problem as the 
difference between the track with the lowest Err,¢q; and the track with the lowest Err is not significant. If 
the track with lowest Err is found, it would always be neighbor with the track with the lowest Err;¢g;. The 
next section proposes a solution to generate multiple Tyenerated- 
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Figure 4. Relationship between Err;,.¢,, and Err 


3.4. Genetic algorithm 

Genetic algorithm (GA) is a family of computational models that was first introduced by Holland in 
1960 in [48] and it was applied to a wide range of optimized problems. Based on Darwinian evolution theory, 
the algorithm can be simplified into the following steps. At first, the initial population is generated, each 
individual in this population holds a unique chromosome. In other words, the population is the representation 
of the possible solutions, and every chromosome is the potential candidate for the optimized problem [49]. 
Next, to measure how good a chromosome is compared to others, a metric called fitness is used and 
calculated using the fitness function [50]. After fitness evaluation, individuals in the population are selected 
to generate the next generation. There are many available approaches for selection algorithms [51]. The 
simplest method is to select randomly but this method does not guarantee that the next generation would be 
better than the current one, so it’s rarely used in practice. Commonly used selection method namely roulette 
wheel selection (RWS), stochastic universal sampling (SUS) [52], linear rank selection (LRS) [53]. Then, 
crossover operator is used to generate the next generation. To choose an appropriate crossover operator, it is 
important to take into consideration the problem type and the chromosome encoding method [54]. Then to 
maintain the diversity of the successors and prevent premature convergence, a mutation operator is applied. 
Because the risk of ruining an individual when performing mutation is high, the mutation rate is usually set to 
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be small. In certain cases, the mutation rate would increase when population fitness has no improvement 
which is also known as adaptive mutation rate [55]. 


3.5. Proposed genetic algorithm 

Based on the steps mentioned earlier, a genetic algorithm is implemented with the flow chart 
illustrated in Figure 5. At initialization, genetic algorithm parameters are set up: population size, number of 
generations, crossover rate, mutation rate, individual index and generation index, population container P, 
environment bounds, and motion data. The population initialization is emphasized in Figure 5 as the 
coordinate is generated randomly to build the track and the position must be within a predefined boundary to 
be appended to the population. Then, when the number of individuals in the population has met the 
requirement, the loop of selection, crossover, and mutate will occur until the convergence condition is 
satisfied or the number of generations exceeds. Details of the implementation will be described in the 
following sections. 


(Xny Yny) 


Chromosome | (x1, y 1) | a9) | bes | eve) | $88 


Figure 5. Chromosome representation 


3.5.1. Chromosome encoding 

At first, a representation of an individual (chromosome) in the population is considered. An array 
with each element consisting of two floating-point numbers was chosen as the representation of coordinate to 
be the chromosome as in Figure 6. Using this form, genetic operators are easier to be implemented later. 


Init GA param: N,,Ng,ém.éc;3 9 = Li =1;F [9] tatoulare sites Os ELE TE Nel amet 


Environment bound: {[Xmin»Xmax |L¥min » Vmax 1} 
Motion data: {(a,d,), (a@2d2),...,(@, dx), «-, (ay, dy, )} 
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XE [Xmin »Xmax ) & Vx E [Ymin Ymax | 
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Figure 6. Proposed genetic algorithm flow chart 


3.5.2. Population initialization 

After chromosome encoding, the next step is to initialize the population. An important parameter 
needed to consider is population size [54]. Although there is no default number to use when selecting population 
size, major factors that need to be examined are the size of search space, how much processing capability is 
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available, and initialization constraints. Large searching space would need more generated solutions to perform 
faster, but there would be trade-off for the consumption of processing power. After trials, a fixed population size 
of around 100 throughout the process is found to be optimal. In addition, to prevent the generated solutions to be 
too far off the environment, an area under constraints where value will be bounded by the maximum and 
minimum coordinate value in the x-axis and the y-axis. If the track is out of bounds, it’ll be recreated. This 
ensures the solutions to be confined in the search space and the method would converge. 


3.5.2. Fitness evaluation 

Fitness value must indicate how well an individual performs compared to others. It can be estimated 
using the proposed metric in (4). However, it is better to keep the intuition that individuals with better 
performance will have higher fitness value, so the fitness value is defined in (6). A metric called fitness sum 
is also used to evaluate the performance of a generation and it is computed (7). 


i 
Err (Tj) 


FU) = € [1, Np] (6) 
Where: 

T; is the track of i*” individual. 

F(T;) is the fitness value of i"individual in the population. 

Err(T;) is the error calculated using (4) of the track of i” individual 

Np is the population size 


FS =)? F() (7) 


3.5.4. Selection 

We chose to use Roulette Selection, which is one of the most popular selection techniques. It 
ensures that the individuals with high fitness value would have a better chance of being chosen. The 
probability of the individual to be selected can be calculated as, 


F(Tj) 


N 
joi F (Ts) 


p(T) = i € [1,Np] (8) 


Where: p(T;) is the probability of track of i" individual being chosen? 


3.5.5. Crossover 

The crossover operator is formulated with an idea that an individual which has higher fitness would 
pass down more of its traits to the offspring. Therefore, a coefficient is defined to determine the amount of 
contribution of each individual of a parent. If chromosome A with fitness value F(T,) and chromosome B 
with fitness value F(T,) are selected as parents, then the coefficients of A and B are computed in (9). We 
denote the offspring chromosome as C and the coordinate of T¢ is calcualted in (10)-(12). 


coeff, = ne; coef fs = (9) 

xe = cof fi, * xe + cof fy * xe (10) 
ve = cof fa» yi + cof fs * yk (11) 
Te = {Wed K€ [1 Nu] (12) 


Where: 
xf, yg is the coordinate of k“" point in T; 
xf, yf is the coordinate of k point in T, 
xP, y® is the coordinate of k“” point in Tp 

The result of crossover operator is a track that lies between parent tracks as illustrated in Figure 7. In 
Figure 7 it is assumed that the track B has a higher fitness value than the fitness of track A so the offspring C 
would be closer to track B as (10)-(12) are used to calculate the coordinate of track C. As a result, the 
offspring would retain the best traits from its parent and pass it down to next generation. 
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3.5.6. Mutation 

It is better not to make big changes to the track position since the space where the experiment took 
place is not large, so mutation rate is selected to be a constant throughout generations. In addition, using 
constant instead of adaptive numbers would help the solution to converge faster and more stable. Compared to 
when the mutation rate increases, it would also increase the risk of ruining the individuals in the population. 


—e Track parent A 
—=— Track parent B 
—t— Track offspring C 


Figure 7. Illustration of result of crossover operator 


4. RESULTS AND DISCUSSION 
4.1. Experiment setups 

The experiment is carried out in the laboratory which is a room 10 meters wide by 10 meters long. 
The environment is set up with tables, chairs, computers, and other networking devices to mimic real 
scenarios. The Wi-Fi network is set up and 8 APs are scattered around the room with unknown coordinates as 
the proposed method doesn’t rely on the location of APs. 

To collect dataset K, 8 points are desinated in the experiment area as anchor points and they are 
marked as red stars in Figure 8(a). Then, a device equipped with Wi-Fi transceiver module and software to 
record Wi-Fi signal strength were used. The device was carried to each of the designated anchor point to 
measure the RSSI and send the measurements to another computer to generate dataset K. 

The user track is simulated by using real RSSI measurements and simulated motion data. 
Figure 8(b) shows the simulated user tracks with blue dots are RSSI measurements points. It can be seen 
from Figure 8(b) that the anchor points were omitted in the user track. In practice, users track RSSI 
measurement points and anchors can overlap. In this case, the performance of the proposed method was 
tested on tracks that contain no anchor points because when the machine learning model trains and validates 
on the non-overlapped dataset, the result of the model will be less biased. Earlier assumption that the PDR 
algorithm is used on motion data, the angle and the distance between RSSI measurement points in the user 
track are obtained. As the result, dataset U is collected. 

Later in the genetic algorithm, multiple trajectory coordinates for the RSSI measurement points in 
dataset U would be generated. If the algorithm runs without the boundary constraints, many Tgenerateq Would 
drift out of the experiment are. To optimize the proposed genetic algorithm, a constraint was set up to force the 
generated coordinate to be in the range of [—5, 15] (meter) in both axes. The reason for the bound to extends to 
negative coordinates is that out of bound tracks may not be useless in certain cases. In the proposed genetic 
algorithm, the crossover operator could find a track that lies between parent’s tracks. From this idea, the result 
of crossover between 2 tracks that far away and out of bound may still lies inside the area. As a result of the 
restriction, the population would be more diverse and potentially achieve faster convergence. 


4.2. Machine learning model selection and configuration 

Two popular regression models are selected which are Epsilon-support vector regression (SVR) and 
k-nearest neighbors regressor (KNN). They are used to evaluate the performance of our proposed method. 
The hyperparameter configuration for both models is shown in Table 1 and Table 2. 
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Figure 8. Experimental settings with, (a) designated anchor points and (b) User simulated trajectory 


Table 1. SVR and hyperparameter configuration 


Hyperparameter Value 
Kernel Radical basis function kernel 
Regularization parameter 1.0 
Epsilon 0.1 


Table 2. KNN and hyperparameter configuration 


Hyperparameter Value 
N (number of neighbors) 5 
Weight function Uniform 
Algorithm Select best from Ball-Tree, kd tree and 
brute 
Leaf size 30 
Metric Minkowski 


4.3. Results without motion error 

The performance of the proposed method was examined using simulated motion data without error. 
The genetic algorithm using 2 machine learning models with their configuration in the previous section and 
the results can be seen in Figure 9. Looking at Figures 9(a) and 9(b), it is apparent that KNN performs better 
than SVR with errors calculated using (5) are 0.9366 meters and 1.16865 meters respectively. The result of 
the proposed method is compared to the conventional fingerprinting method with machine learning where 
dataset K is used as the training dataset and model will predict the coordinate of RSSI measurement points in 
testing dataset U. The same machine learning models as well as the same configuration mentioned earlier for 
the method were used and the result can be seen in Figure 10. 

While SVR is more flexible and could represent complex functions, the result in Figure 10(a) is 
chaotic and unusable. Because there are only 8 training datapoint in dataset K, this causes the KNN model to 
perform with multiple overlaps like in Figure 10(b). The error calculated using (5) of SVR and KNN, in this 
case, are 2.42199 and 2.64846 meters respectively. In general, it is expected that if the conventional 
fingerprinting method is used on a small training dataset, the trained models would suffer from underfitting 
and produce unwanted results. 

Besides the accuracy and error of the results, the performance of the genetic algorithm was also 
measured. The number of generations it takes to converge and the fluctuation of fitness and error over each 
generation are considered. In addition to fitness sum in (7), the error sum is defined, 


N. 
ES = 2, Ertreai(T;) (13) 


Where: Err;¢q)(7;) is the real error calculated using Tyeneratea Of j th individual in the population and T,.q) 
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Figure 10. Results from using conventional fingerprinting method, (a) SVR, (b) KNN 


Looking at the error sum in Figure 11(a), a similar case can be seen, but this time SVR had an 
increase of error from 27-ith generation. Some factors can contribute to the fluctuation such as the harmful 
mutation that happened on the good individuals or randomness in the initialization. As can be seen in 
Figure 11(b), the fitness sum increases rapidly at first couple generation and from generation 10°”, the 
solution converged and KNN achieve higher fitness sum than SVR. 

The proposed method results depend on the accuracy of dataset K so the error in dataset K also have 
an impact. It may happen because of the mutation and the genetic algorithm doesn’t know the real error of 
the Tgeneratea SO there is a chance that it would select the track with a higher fitness value but having higher 
Err;eq,. However, as mentioned earlier that the algorithm ensures the selected track to be the closest track’s 
neighbors, so the accuracy only drops slightly. 


4.4. Results with motion error 

In the previous section, it is assumed that the results of the PDR algorithm do not have errors. In this 
section, the proposed method performance is evaluated when IMU sensors are affected by noise, drift, and 
bias and produce errors in the results of the PDR algorithm. The error of the distances and angles in the user 
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track is set to be in the range from 5 percent to 30 percent. The error was created by a random process that 
follows Gaussian distribution and then added to the distances and angles generated earlier. Using the 
proposed method with the same configuration of genetic algorithm and machine learning models as the 
previous section and the results when using the SVR model are illustrated in Figures 12(a) to 12(f) while the 
results of the KNN model are shown from Figure 13(a) to 13(f). 


et NN IN Ng eS | 


/ 
U 


400 30 

3 E 

~ 5 

E 300 % 25 

2 A 

72) vo 

= am 

2D 400 a 
15 

100 
ee ee ee ee 
“0O)©~—S od10—<“<i—=~CD 30 40 50 0 10 20 30 40 50 
i-th Generation i-th Generation 
(a) (b) 


Figure 11. Plot of error sum and fitness sum with respect to generation (N, = 50), (a) Error sum, 
(b) Fitness sum 


Figure 12. Results of the proposed method using SVR with different amount of motion error, (a) 5%, 
(b) 10%, (c) 15%, (d) 20%, (e) 25%, (f) 30% 
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Figure 13. Results of the proposed method using KNN with different amount of motion error, (a) 5%, 
(b) 10%, (c) 15%, (d) 20%, (e) 25%, (f) 30% 


As can be seen from Figure 12 and Figure 13, when the motion error increases, the deformation, and 
distortion of the user trajectory is also greater. If they are compared to the tracks using the conventional 
fingerprinting method, it is clear that there is less chaos in terms of user track shape and trajectory. In certain 
positions where motion error is not large, the shape of the user track can still be recognized. As the shape of 
the user track depends entirely on motion data and PDR algorithm, the only way to improve and counter the 
deformation is to use better sensors and apply a better PDR method. 

To be consistent, once the error was added, the proposed method using SVR and KNN was used. 
That explains why the shape of the track for every level of error when using SVR and KNN is the same. Due 
to the existence of random processes in the simulation, 20 independent runs were performed with the same 
configuration and the average of average distance error from 20 runs is calculated. This value is refered in the 
graph and later as average error for convenience, results are shown in Figure 14. 
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Figure 14. Plot of average error with respect to angle and distance error of the proposed method and 
conventional fingerprinting method 
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From Figure 14, it is clear that when angle and distance error increases, the average error also rises. There 
is an exception at 20% of angle and distance error that may be caused by big motion error generated by the random 
process or harmful mutation in genetic algorithm. In general, it’s expected that the user track average error is 
affected by motion error. Details can be seen in Table 3, it can be observed that even when motion error is 30%, 
the proposed method was able to achieve average distance error at 1.589 meters and 1.674 meters using the SVR 
model and KNN model. Compared to the conventional fingerprinting method mentioned earlier, the proposed 


method's average error is nearly 34.4% and 36.7% less, which is a significant improvement. 


Table 3. Detail average error of the proposed method with different amount of motion error 
Proposed method average error (m) 


Motion error (%) 


SVR KNN 
0 1.145 0.875 
5 1.108 0.917 
10 1.108 0.941 
15 1.219 1.043 
20 1.174 1.211 
25 1.404 1.257 
30 1.589 1.674 


The results are further analyzed by comparing the empirical cumulative distribution function (CDF) 
of the distance error of points in the user’s track. The distance error of points is defined as the Euclidean 
distance from the one point in Tgenerateq and the corresponding point in T;¢q;. Results can be seen in 
Figure 15(a) and Figure 15(b). In Figure 15(a), it can be seen that distance and angle error affect the 
distribution of distance error of points in the user’s track. When motion error is low, for example at 5 percent 
motion error, most of the points in user track would have distance error under 1.0 meters. On the other hand, 
when motion error is large, the range also increases and even reaching over 4.0 meters of distance error at 
30% motion error for both machine learning models. 
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Figure 15. Plot of CDF of distance error of points in user track when applying proposed method using 
different machine learning models. (a) SVR, (b) KNN 


To study the effect of angle and distance error separately, 20 independent runs were done using the 
same procedure and configuration as mentioned but with angle error and distance error separately. The result 
with only angle error is shown in Figure 16(a) and distance error only is shown in Figure 16(b). Looking at 
Figure 16(a), a similar pattern can be seen as in Figure 14. When the angle error ranges from 0 to 15 percent, 
KNN performs better and achieves a lower average error. But as the motion error gradually increases, the 
difference in performance for SVR and KNN is insignificant. 

The average error when only distance error is added is lower than when the only angle is added as 
the results show in Figure 16(b). It is clear that the average error increases gradually when more and more 
distance error is put in thus the fluctuation between levels of distance error is very small. From the 
observation, angle error is the source of motion error which contributes more to the average error. 
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Figure 16. Plot of average error with respect to angle and distance error seperately of the proposed method 
and conventional fingerprinting method, (a) Angle error, (b) Distance error 


4.5. Results with different number of anchor points 

Because the proposed method generates user track coordinate by measuring how well the machine 
learning model performs on anchor dataset K, it’s important to show how much performance is affected 
when the number of anchor points in dataset K changes. Using the same method to add distance and angle 
error and all the same configurations from the previous section, the number of anchor points is lowered from 
8 to 6 and 4 and measured the difference in performance. The results of the proposed method using SVR are 
illustrated in Figure 17(a) and the one using KNN is illustrated in Figure 17(b). 
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Figure 17. Plot of average error with respect to distance and angle error of the proposed method using 
different machine learning model and different number of anchor points. (a) SVR, (b) KNN 


For both cases of SVR and KNN, it is obvious that the number of anchor points would greatly affect 
the overall performance of the proposed method. In the case of SVR, there is a big difference in terms of 
average error between using 8 anchor points and using 4 anchor points. As for KNN, the trend is similar to 
SVR, however, the difference between using 6 and 8 anchor points is insignificant. Detailed results are 
shown in Table 4. 

From the results, even though the proposed method requires a small number of RSSI measurement 
points with known coordinates, it is ideal to collect a reasonable number of data points for the proposed 
method to perform well. Because when the number of data points in dataset K is set too low, there is not 
enough information for the machine learning models to perform a correct validation. Using more data points 
would certainly increase the performance of the proposed method but it would require more effort in 
collecting data. In real cases, the number of data points in dataset K can be adjusted considering several 
factors like how large the environment is, how dense the AP network and how the proposed method currently 
performs to satisfy the requirements. 
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o 


Average distance error (m) 


Mode! PiptionenOn Ge) 6 anchor points 4 anchor points 
SVR 0 1.497 2.039 
5 1.516 2.053 
10 1.587 2.159 
15 1.584 2.215 
20 1.746 2.303 
25 1.870 2.392 
30 1.921 2.496 
KNN 0 0.876 1.637 
5 0.943 1.616 
10 0.971 1.644 
15 1.080 1.776 
20 1.167 1.882 
25 1.444 1.904 
30 1.612 2.086 


343 


Comparison of CDF of distance error of points in user track obtained by our method across different 
number of anchors and motion error are presented in Figures 18(a) to 18(d). Results in Figure 18(a) and 
Figure 18(b) can be compared with the results in Figure 15(a) and Figure 15(b). When the number of anchor 
points decreases many distribution functions of distance error of RSSI measurement points also tend to shift 
to the right. In other words, they would have more points than containing a higher error. This is expected to 
happen because it has been discussed earlier how the performance of the proposed method is affected by the 


number of anchor points. 
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Figure 18. Plot of CDF of distance error of points in user track when applying proposed method using 
different machine learning models and number of anchor points, (a) SVR with 6 anchors, 
(b) KNN with 6 anchors, (c) SVR with 4 anchors, (d) KNN with 4 anchors 


——- 10% motion error —*— 25% motion error 


—s— 15% motion error —*— 30% motion error 


The same trend can be seen in Figure 18(c) and Figure 18(d) as the distribution covers a higher 
range, and the distance error is higher. Compared to fingerprinting approach in terms of data collection, only 
8 data points are needed to make an accurate estimation. This makes the IP system that implements the 
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proposed method to be flexible and could adjust, compensate for the error easily. If the environment has 
changes, only a few new data points need to be collected and then the accuracy can be adjusted by having 
more or fewer anchor points in the environment. 


5. CONCLUSIONS 

This paper introduces a new method to estimate indoor target trajectory location by combining 
Fingerprinting and PDR methods using a genetic algorithm. Experimentation and simulation results have 
shown that the proposed method can achieve high localization accuracy even with a limited number of 
training points with known coordinate and noisy motion instruments. From the indoor positioning 
perspective, the method would reduce a lot of effort in collecting and maintaining the dataset, which saves a 
lot of time and resources. In addition, no infrastructure needed to be installed as we take advantage of IMU 
sensors available in mobile devices and Wi-Fi networks. The main limitation of the proposed method is that 
using a genetic algorithm and machine learning models would increase the time complexity if parameters like 
the number of individuals in a population and the number of generations are set too high. By tuning 
parameters, the genetic algorithm can run faster because results have shown that GA converges fast. But still, 
mobile device hardware capability is limited, thus running such algorithms would be unsuitable. A better 
solution is to use a centralized server that is similar to other approaches like fingerprinting or interpolation 
and extrapolation so that more processing power can be harnessed and workload is reduced for client devices. 
In the future, further experimentation on real scenarios should be considered to validate the proposed method 
in different environments and configurations. In addition, studies on how to reduce the computational time 
would enable the proposed method to be used in real-time tracking applications. Besides traditional genetic 
algorithms, several algorithms in the field of evolutionary computation can be used for example, ant colony 
or particle swarm optimization. Thus, it would open many opportunities for research to apply the 
evolutionary algorithm to the field of indoor positioning. 
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